Automatically Discovering Word Senses
نویسندگان
چکیده
We will demonstrate the output of a distributional clustering algorithm called Clustering by Committee that automatically discovers word senses from text1.
منابع مشابه
Discovering Corpus-Specific Word Senses
This paper presents an unsupervised algorithm which automatically discovers word senses from text. The algorithm is based on a graph model representing words and relationships between them. Sense clusters are iteratively computed by clustering the local graph of similar words around an ambiguous word. Discrimination against previously extracted sense clusters enables us to discover new senses. ...
متن کاملDiscovering Word Senses for Polysemous Words Using Feature Domain Similarity
This paper presents a new clustering algorithm called DSCBC which is designed to automatically discover word senses for polysemous words. DSCBC is an extension of CBC (Pantel and Lin, 2002), and incorporates feature domain similarity: the similarity between the features themselves, obtained a priori from sources external to the dataset. By incorporating the feature domain similarity in clusteri...
متن کاملWord Sense Induction Using Lexical Chain based Hypergraph Model
Word Sense Induction is a task of automatically finding word senses from large scale texts. It is generally considered as an unsupervised clustering problem. This paper introduces a hypergraph model in which nodes represent instances of contexts where a target word occurs and hyperedges represent higher-order semantic relatedness among instances. A lexical chain based method is used for discove...
متن کاملSussx: WSD using Automatically Acquired Predominant Senses
We introduced a method for discovering the predominant sense of words automatically using raw (unlabelled) text in (McCarthy et al., 2004) and participated with this system in SENSEVAL3. Since then, we worked on further developing ideas to improve upon the base method. In the current paper we target two areas where we believe there is potential for improvement. In the first one we address the f...
متن کاملAutomatic Multimedia Knowledge Discovery, Summarization and Evaluation
This paper presents novel methods for automatically discovering, summarizing and evaluating multimedia knowledge from annotated images in the form of images clusters, word senses and relationships among them, among others. These are essential for applications to intelligently, efficiently and coherently deal with multimedia. The proposed methods include automatic techniques (1) for constructing...
متن کامل